Safe Feature Elimination for the LASSO and Sparse Supervised Learning Problems
نویسندگان
چکیده
We describe a fast method to eliminate features (variables) in l1-penalized least-square regression (or LASSO) problems. The elimination of features leads to a potentially substantial reduction in running time, especially for large values of the penalty parameter. Our method is not heuristic: it only eliminates features that are guaranteed to be absent after solving the LASSO problem. The feature elimination step is easy to parallelize and can test each feature for elimination independently. Moreover, the computational effort of our method is negligible compared to that of solving the LASSO problem roughly it is the same as single gradient step. Our method extends the scope of existing LASSO algorithms to treat larger data sets, previously out of their reach. We show how our method can be extended to general l1-penalized convex problems and present preliminary results for the Sparse Support Vector Machine and Logistic Regression problems.
منابع مشابه
Mammalian Eye Gene Expression Using Support Vector Regression to Evaluate a Strategy for Detecting Human Eye Disease
Background and purpose: Machine learning is a class of modern and strong tools that can solve many important problems that nowadays humans may be faced with. Support vector regression (SVR) is a way to build a regression model which is an incredible member of the machine learning family. SVR has been proven to be an effective tool in real-value function estimation. As a supervised-learning appr...
متن کاملSafe Feature Elimination in Sparse Supervised Learning
We investigate fast methods that allow to quickly eliminate variables (features) in supervised learning problems involving a convex loss function and a l1-norm penalty, leading to a potentially substantial reduction in the number of variables prior to running the supervised learning algorithm. The methods are not heuristic: they only eliminate features that are guaranteed to be absent after sol...
متن کاملTopics in Large-scale Sparse Estimation and Control
Topics in Large-Scale Sparse Estimation and Control by Tarek Sami Rabbani Doctor of Philosophy in Engineering-Mechanical Engineering and the Designated Emphasis in Computational Science and Engineering University of California, Berkeley Professor Laurent El Ghaoui, Chair In this thesis, we study two topics related to large-scale sparse estimation and control. In the first topic, we describe a m...
متن کاملSafe Screening with Variational Inequalities and Its Application to Lasso
Sparse learning techniques have been routinely used for feature selection as the resulting model usually has a small number of non-zero entries. Safe screening, which eliminates the features that are guaranteed to have zero coefficients for a certain value of the regularization parameter, is a technique for improving the computational efficiency. Safe screening is gaining increasing attention s...
متن کاملFrom safe screening rules to working sets for faster Lasso-type solvers
Convex sparsity-promoting regularizations are ubiquitous in modern statistical learning. By construction, they yield solutions with few non-zero coefficients, which correspond to saturated constraints in the dual optimization formulation. Working set (WS) strategies are generic optimization techniques that consist in solving simpler problems that only consider a subset of constraints, whose ind...
متن کامل